Biological Domain Identification Based in Codon Usage by Means of Rule and Tree Induction

نویسندگان

  • Antonio Neme
  • Pedro Miramontes
چکیده

There are three domains in living nature: archaea, bacteria and eukarya. It has been shown, trough a number of multivariate tools, that codon usage, a 64 dimensional vector that stablishes how often a given organism makes use of each codon, is related to domain. Another method is proposed here based in rule and tree induction from codon usage of several organisms. It is shown that domain can be identified trough codon usage and a simple set of rules. Two methods were applied, CN2 and C4.5. Obtained rules describe data better than other methods, in the sense that are topological interpretable and have phenomenological meaning.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene

Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...

متن کامل

Application of the rule extraction method to evaluate seismicity of Iran

Assessing seismic hazards involves specifying the likelihood, magnitude and location of earthquakes in a region. Predicting the seismic hazards is the first step in reducing the impact of the damage caused by an earthquake.  In this study, to fully utilize all the known parameters which may possibly affect the occurrence of earthquakes (mb ≥ 4.5); a data-driven rule-extraction method called the...

متن کامل

Bioinformatics Comparison of Codon Usage of Genes Encoding Phosphate Transporter in Terms of Salt Tolerance, Day Length, Temperature and Pollination in Different Plants

In order to study and compare the phosphate transporter gene codon usage and it's respond to the traits like salt tolerance, day length, Pollination and temperature in different plants, 100 isoform from 10 plants are extracted from NCBI website and then analyzed with Gene Infinity and Minitab 16 software. The result shows that the highest codon usage similarity (81.95%) was for wheat a...

متن کامل

Bioinformatics Comparison of Codon Usage of Genes Encoding Phosphate Transporter in Terms of Salt Tolerance, Day Length, Temperature and Pollination in Different Plants

In order to study and compare the phosphate transporter gene codon usage and it's respond to the traits like salt tolerance, day length, Pollination and temperature in different plants, 100 isoform from 10 plants are extracted from NCBI website and then analyzed with Gene Infinity and Minitab 16 software. The result shows that the highest codon usage similarity (81.95%) was for wheat a...

متن کامل

MMDT: Multi-Objective Memetic Rule Learning from Decision Tree

In this article, a Multi-Objective Memetic Algorithm (MA) for rule learning is proposed. Prediction accuracy and interpretation are two measures that conflict with each other. In this approach, we consider accuracy and interpretation of rules sets. Additionally, individual classifiers face other problems such as huge sizes, high dimensionality and imbalance classes’ distribution data sets. This...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004